Fast Transparent Cluster-Wide Paging
نویسندگان
چکیده
In a cluster with a very low-latency interconnect, the remote memory of nodes can serve as a storage that is faster than local disk but slower than local memory. In this paper, we address the problem of transparently utilizing this cluster-wide pool of unused memory as a low-latency paging device. Such a transparent remote memory paging system can enable large-memory applications to benefit from cluster-wide memory resources without compromising application performance while obviating the need for very large local memory or modifications to legacy applications. There was considerable interest in this subject in the 1990’s, which saw a number of remote paging systems built over specialized interconnects such as Myrinet and ATM switches. However, in recent years, this subject has not received the research attention it deserves. With the advent of highly affordable commodity gigabit Ethernet switches in the market, we examine the feasibility and advantages of using gigabit Ethernet to perform fully distributed remote memory paging. We leverage upon our earlier experiences with the Anemone project [11] – a centralized remote paging system – to develop a fully distributed, transparent, Linux-based alternative over gigabit Ethernet to avoid the disk I/O bottleneck. A self-managing pseudo block device module resides in every participating workstation in the cluster and is responsible for a multitude of functions. This module avoids the entire IP stack and user-space overheads, manages the available cluster-wide memory as well as local disk swap devices, eliminates centralized decision making, is completely transparent to the kernel, dynamically adapts to the availability of remote memory, and fully exploits the bandwidth-delay product of the underlying interconnect. Preliminary performance results indicate that the distributed paging system design can yield sequential program execution speedups of up to a factor of 6 when compared to disk-based paging (versus 3 in our in our cenrtalized design) and pagefault latencies in the order of 210 microseconds.
منابع مشابه
Distributed Anemone: Transparent Low-Latency Access to Remote Memory
Performance of large memory applications degrades rapidly once the system hits the physical memory limit and starts paging to local disk. We present the design, implementation and evaluation of Distributed Anemone (Adaptive Network Memory Engine) – a lightweight and distributed system that pools together the collective memory resources of multiple machines across a gigabit Ethernet LAN. Anemone...
متن کاملA Transparent Remote Paging Model for Virtual Machines
In virtual machine systems, with the increase in the number of VMs and the demands of the applications, the main memory is becoming a bottleneck for the application performance. To improve paging performance and alleviating thrashing behavior for memory-intensive or I/O-intensive virtual machine workloads, we proposed hypervisor based remote paging, which allows a virtual machine to transparent...
متن کاملExploiting Remote Memory in InfiniBand Clusters using a High Performance Network Block Device (HPBD)
Traditionally, remote memory accesses in cluster systems are very expensive operations, which perform 20-100 times slower than local memory accesses. Modern RDMA capable networks such as InfiniBand and Quadrics provide low latency of a few microseconds and high bandwidth of up to 10 Gbps. This has made remote memory much closer to the local memory system. Using remote idle memory to enhance loc...
متن کاملTerminal Paging with Fast Speed , Low Cost and High Quality of Service 3 Table 1
Dynamic location management for a mobile terminal is a key problem in wireless communication networks. There are three fundamental performance and cost issues in terminal paging. The first issue is the speed (i.e., time delay) of terminal paging. The second issue is the cost (i.e., the number of cells paged) of terminal paging. The third issue is the quality of service (QoS), i.e., the probabil...
متن کاملIDMP-based FAST HANDOFFS AND PAGING IN IP-BASED CELLULAR NETWORKS
In this paper, we consider the use of the recently proposed Intra-Domain Mobility Management Protocol (IDMP) in and generation (3/4G) wireless cellular networks to reduce the latency of intradomain location updates and the mobility signaling traffic. We first present enhancements to basic IDMP that provide fast intra-domain handoffs by using a duration-limited, proactive packet ‘multicasting’ s...
متن کامل